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ABSTRACT 

The J-binding protein 1 (JBP1) is essential for bio- 
synthesis and maintenance of DNA base-J (|5-d- 
glucosyl-hydroxymethyluracii). Base-J and JBP1 
are confined to some pathogenic protozoa and are 
absent from higher eukaryotes, prokaryotes and 
viruses. We show that JBP1 recognizes 
J-containing DNA (J-DNA) through a 160-residue 
domain, DB-JBP1, with 10000-fold preference over 
normal DNA. The crystal structure of DB-JBP1 
revealed a helix-turn-helix variant fold, a 'helical 
bouquet' with a 'ribbon' helix encompassing the 
amino acids responsible for DNA binding. Mutation 
of a single residue (Asp525) in the ribbon helix 
abrogates specificity toward J-DNA. The same 
mutation renders JBP1 unable to rescue the 
targeted deletion of endogenous JBP1 genes in 
Leishmania and changes its distribution in the 
nucleus. Based on mutational analysis and 
hydrogen/deuterium-exchange mass-spectrometry 
data, a model of JBP1 bound to J-DNA was con- 
structed and validated by small-angle X-ray scatter- 
ing data. Our results open new possibilities for 
targeted prevention of J-DNA recognition as a 
therapeutic intervention for parasitic diseases. 



INTRODUCTION 

Base J (1), (3-D-glucosyl-hydroxymethyluracil, is a 
hypermodified base present in all kinetoplastid flagellates 
studied (2), including the Trypanosoma, Leishmania and 
Crithidia genera and in Euglena (3), but absent from 
other eukaryotes, prokaryotes and viruses [reviewed in 
(4,5)]. J is a minor base replacing about 0.5% of T in 
the nuclear DNA of kinetoplastida (6,7) and is mainly 
present in the telomeric repeat sequence (GGGTTA) n 
(2,8). Small amounts of J are also found in other repetitive 
sequences of Trypanosoma brucei, such as the expression 
sites of variant surface glycoprotein (VSG) genes (9) and 
in sequences between transcription units (10). 

Synthesis of the J nucleotide (11) and J-base containing 
(J-DNA) oligonucleotides by chemical methods (12,13), 
allowed the identification of a 93kDa J-binding protein 
1 (JBP1) in extracts of T. brucei, Leishmania species and 
Crithidia fasciculata (14). JBP1 binds specifically to base 
J in duplex DNA (15). JBP1 is essential in Leishmania 
(16), but not in T. brucei. The absence of JBP1 in 
T. brucei has no effect on growth, DNA repeat stability 
or gene expression, but does result in a 20-fold decrease in 
J level relative to wild-type cells (17). This led to the hy- 
pothesis that JBP1 catalyzes the first and rate-limiting step 
in J biosynthesis, the hydroxylation of T in DNA. 
This hypothesis was supported by the identification of 
a weak sequence similarity between JBP1 and Fe 2+ and 
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2-oxoglutarate (2-OG)-dependent hydroxylases 
(dioxygenases) (18). Indeed, replacement of each of the 
four amino acids known to be essential for hydroxylase 
activity resulted in mutant proteins unable to complement 
JBP1 function in either T. brucei or Leishmania, but still 
able to bind to J-DNA. This showed that thymidine 
hydroxylase activity and J-DNA binding are independent 
functions of JBP1 (18,19). 

There is one other protein in kinetoplastid flagellates 
which partially shares sequence similarity with JBP1, 
J-binding protein 2 (JBP2). JBP1 and JBP2 share 34% 
identity in their N-terminal halves (20), which contains 
the thymidine hydroxylase function of JBP1 (18) and of 
JBP2 (21,22). The C-terminal half of JBP2, but not of 
JBP1, contains a region similar to proteins with SWI2/ 
SNF2-like chromatin remodeling activity. Although 
JBP2 contributes to J biosynthesis (10,21,22), it is not es- 
sential in T. brucei or in Leishmania and there is no 
evidence that JBP2 can bind to J-DNA (20). 

Although JBP1 and JBP2 are unique proteins, a distant 
homolog of the JBP1/2 hydroxylase domain was recently 
identified in the mammalian protein TET1 (23,24), a 
fusion partner of the MLL gene in acute myeloid 
leukemia. TET1 and the related TET2 and TET3 
proteins catalyze the conversion of 5-methylcytosine in 
DNA to 5-hydroxymethylcytosine, a reaction that may 
play an important role in the epigenetic control of gene 
expression (23). JBP and TET proteins have been grouped 
together in the TET/JBP subfamily of dioxygenases (25). 

The binding of JBP1 to J-DNA has been studied by 
various methods. Using J-containing duplex oligonucleo- 
tides in a gel retardation assay, it was shown that JBP1 
binds to J-containing oligonucleotides with an affinity 
between 40 and 140 nM (15). A fluorescence anisotropy 
polarization assay (FP) (26) yielded affinities as low as 
13 nM. Binding to J-DNA is highly specific, since compe- 
tition assays using gel-retardation indicated that a 
500-fold excess of T-DNA could not out-compete 
J-specific DNA binding (27). The FP assay showed an 
affinity for T-DNA ~ 100 times lower than that for 
J-DNA (1370 nM compared to 13 nM). The mode of inter- 
action of JBP1 with J-DNA has been probed with several 
biochemical methods (15,27). Substitution of the 
hydroxymethylU in the J-base by hydroxymethylC, 
resulted in a 17-fold decrease in J-binding, showing that 
the pyrimidine base to which the glucose is attached 
co-determines binding affinity. At least 5 bp on both 
sides of J-base are required for optimal binding of JBP 1, 
although critical contacts are restricted to two bases: 
major and minor groove contacts with base J and a 
sequence-independent major groove contact with the 
base immediately 5' of base J on the same strand 
(position J-l) (27). Subsequent studies in which the 
sugar moiety of base J was systematically varied, have 
suggested a specific role for nucleotide J-l: its 
non-bridging phosphoryl oxygen hydrogen bonding to 
the equatorial 2- and 3-hydroxyl groups of the pyranosyl 
ring of the glucose of base J and locking the glucose in an 
'edge-on' conformation perpendicular to the plane of the 
major DNA groove (26). 



To determine exactly how JBP1 binds to J-DNA, 
three-dimensional molecular structures are required. As 
the first step toward this goal, we have identified a 
160-residue autonomous folding unit (domain), 
DNA-Binding JBP1 domain (DB-JBP1), which binds to 
J-DNA with approximately the same affinity and specifi- 
city as full-length JBP1. We have determined the crystal 
structure of DB-JBP1, revealing a novel 'helical bouquet' 
fold and suggesting a model for the interaction with 
J-DNA. Hydrogen/deuterium exchange (HDX) mass 
spectrometry (MS) experiments probing the interaction 
between full length JBP-1 and J-DNA, showed that all 
significant changes upon binding of J-DNA were all 
within specific regions of the DB-JBP1 domain only. 
With site-directed mutagenesis and binding studies, we 
show that a single aspartate residue in JBP1 is essential 
for specific J-base recognition in vitro and for the function 
of JBP1 in vivo. Finally, based on all the data above, we 
constructed a structural model for the binding of 
DB-JBP1 to J-DNA, which was confirmed by small 
angle X-ray scattering (SAXS) measurements of the 
complex between JBP1 and J-DNA in solution. 

MATERIALS AND METHODS 

Expression and purification of recombinant proteins 

The JBP1 coding sequences from the Leishmania species 
tarentolae, major, aethiopica, donovani, infantum and from 
T. brucei and C. fasciculata were amplified by PCR 
methods using Pfu polymerase (Stratagene) and inserted 
by ligation independent cloning in our in-house pET 
vector derivative, containing a hexa-histidine tag and an 
HRV 3C protease site. Clones were tested for the produc- 
tion of soluble protein in a heterologous Escherichia coli 
expression system. While JBP1 from C. fasciculata 
(C/-JBP1) was obtained in large amounts, as expected 
from previous experience (15), the only other JBP1 made 
in E. coli in soluble form was the one from Leishmania 
tarentolae (L?-JBP1) from a codon optimized gene. 

For large-scale preparations, vectors were transformed 
to B121(DE3)T1 R cells and grown in the presence of kana- 
mycin. Protein production was induced with 0.3 mM 
IPTG at 15°C at an optical density of -0.8 for 16-18 h. 
Cell pellets were obtained by centrifugation at 4000 rpm 
and lysed with the pneumatic homogenizer EmulsiFlex-C5 
in 20 mM Hepes pH 7.5, 350 mM NaCl, 1 mM TCEP, 
10 mM Imidazole (Buffer A), plus a tablet of Complete 
EDTA-free (Roche) at 4°C. After clearing the supernatant 
by centrifugation, the proteins were bound in batch to 
Ni-chelating sepharose beads for 30min at 4°C (GE 
Healthcare), washed with Buffer A and eluted in Buffer 
A containing 400 mM Imidazole. Fractions were analyzed 
by SDS PAGE. 

The JBP1 elution fractions were cleaved with 3C 
protease overnight at 4°C and applied to a S75 16/60 gel 
filtration column. Yields were between 5 and 50mg/l of 
culture, depending on the construct. All proteins were 
>95% pure as judged by SDS-PAGE. 

All truncation constructs were amplified by PCR using 
Pfu polymerase (Stratagene) and cloned for expression in 
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the same vector, allowing cleavable His-Tag expression. 
Site-specific mutants were created using the 
QuickChange (Stratagene). Expression and purification 
experiments were carried out as for full-length JBP1. 

Multi-angle laser light scattering 

Multi-angle laser light scattering (MALLS) experiments 
were performed in a Superdex 200 HR 10/30 column 
attached to an Akta FPLC and coupled to a 
miniDAWN light scattering detector (Wyatt 
Technology) and a Dn-1000 differential refractive index 
detector (WGE Dr Bures). Hundred microliter of 
purified L/-JBP1 at a concentration of ~5.0mg/ml were 
injected onto the column. Data analysis were carried out 
with the program Astra using a dn/dc value of 0.185. 

Analytical size exclusion chromatography 

Size exclusion chromatography experiments were per- 
formed with a Superdex 200 3.2/30 column attached to 
an Akta Purifier system from GE Healthcare. For 
C/-JBP1 alone, we diluted 10 ul of a ~50uM stock of 
protein (0.5nmol) in a total volume of 50 ul of buffer 
(20 mM Hepes/HCl pH 7.0 and 200 mM NaCl) to a con- 
centration of ~10 uM and injected that in the column. The 
elution volume (1.50 ml) indicated a size of approximately 
100 kDa. Subsequently, a J-containing oligonucleotide 
with the sequence TAACCCJAACCCCTA was annealed 
with its complementary strand and concentrated to about 
50 uM. A small amount of protein (1 ul of the protein 
stock above, ~0.1nmol) was mixed with a five times 
molar excess of the concentrated oligonucleotide (5 ul, 
~0.5nmol) in a total buffer volume of 50 ul as above 
and injected onto the column under identical conditions. 
The molar excess of oligonucleotide was used to ensure 
that all of JBP1 was bound to J-DNA. The elution volume 
of the JBP1: J-DNA complex (1.48 ml) was very close to 
that of JBP1 alone (1.50 ml). However, the shift in the 
absorbance at 260 and 280 nm wavelength, clearly 
indicated that in the JBPLJ-DNA mixture, the peak cor- 
responds to a protein:DNA complex. Using the theoretic- 
al molar extinction coefficients for JBP1 (e = 101 855) and 
for the oligonucleotide (e = 169 000), we calculate a stoi- 
chiometric ratio (28) of 1:1.01, that clearly indicated that 
C/ : JBP1 recognizes J-base containing DNA as a 
monomer. 

Fluorescence polarization anisotropy assays 

A single-stranded J-containing oligonucleotide with the 
sequence 5'-GGCAGCJGCAACAA-3' was synthesized 
as we previously described (26), together with its 
T-containing equivalent GGCAGCTGCAACAA. A 
common complementary oligonucleotide was synthesized 
labeled in the 5' with tetramethylrhodamine. Both comple- 
mentary pairs of oligonucleotides were dissolved in water 
and heat-annealed to yield J-DNA or T-DNA and the 
duplex DNA was purified over a Superdex-75 10/30 
Hi-Load gel filtration column. Dye concentration was 
determined through UV-VIS measurements and the 
double-stranded oligonucleotides were calibrated against 
the standard curve. The binding reaction was carried out 



with InM TAMRA-labeled DNA, 20 mM HEPES pH 
7.5, 140 mM NaCl (unless otherwise indicated for the 
salt dependence experiments), 1 g/1 chicken ovalbumin, 
2 mM MgCL: and 1 mM TCEP. The maximum possible 
amount of all JBP1 protein variants was added and sub- 
sequent dilutions were achieved by serial l:l-dilutions in 
three repeats. The reaction was incubated for lOmin at 
4°C and room temperature before the measurements. 
Measurements were performed on an En Vision 2101 
multilabel reader (Perkin Elmer) using 96-well optiplates 
(Perkin Elmer). The excitation filter was a Perkin Elmer 
X531 with a CWL of 531 nm, while the P and S emission 
filter were M579p with a CWL of 579 nm. All measure- 
ments were preformed at room temperature. 

Reverse footprinting 

C/ : JBP1 alone or mixed with J-DNA were prepared mug 
amounts and digested with several proteases and in 
various conditions. Fragments of the C/ : JBP1: J-DNA 
mixture resistant to proteolysis, which were not present 
when JBP1 alone was digested under identical conditions, 
were obtained using a ratio of trypsin:C/ : JBPl of 1:50 and 
an incubation of 30 min on ice. 

The digested mixture was applied to a 1 % agarose gel in 
TBE and run for 30 min at 100 V. Ethidium bromide and 
Coomassie blue were used to visualize the DNA and 
protein, respectively, in the same gel. The region of the 
agarose gel containing the protein bound to shifted DNA 
was isolated and the protein was eluted by melting the 
agarose gel slice under mild conditions. The eluate was 
size-fractionated by polyacrylamide gel electrophoresis in 
a 12% tris-glycine gel and transferred on a PVDF 
membrane in Tris-Glycine-SDS-Methanol buffer for 1 h 
at 100 V. The membrane was stained with Ponceau S and 
the band was cut out and sent for N-term sequencing; it 
showed the unique sequence (-)VTSSG (Cambridge 
Peptides, UK). 

Crystallization of DB-JBP1 

DB-JBP1 was crystallized using a robotic setup we have 
described previously (29). Clustered needle-like crystals 
grew at 4°C overnight in 20% w/v PEG 8000, 0.2 M 
MgCl 2 and 0.1 M Tris pH 8.5, were crashed and used as 
seeds for a new screening round that gave bigger single 
crystals in 20% w/v PEG 3350 and 0.2 M potassium 
nitrate. These new crystals were used as seeds for a new 
screening round with selenomethionine substituted protein 
to obtain thick hexagonal plates with sharp edges in 
15-20% w/v PEG 6000 and 0.2 M potassium nitrate. 
These crystals were transferred into a solution containing 
mother liquor supplemented with 20% glycerol, mounted 
in litho-loops and vitrified in liquid nitrogen for diffrac- 
tion studies. Crystals from the various rounds of improve- 
ment are shown in Supplementary Figure SI. 

Crystallographic structure determination of DB-JBP1 

Crystallographic statistics are shown in Table 1. X-ray 
diffraction data were collected at a wavelength of 
0.978 A at the SLS beamline PXI, integrated with XDS 
(30) and scaled using SCALA (31). Positions of the 
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Table 1. Crystallographic data collection and refinement statistics 



Data collection 



Space group 


P6,22 


Cell dimensions 




a, b, c (A) 


67.34, 67.34, 186.76 


<*, A Y (°). 


90, 90, 120 


Resolution (A) 


20.0-1.90 (2.00-1.90) 


^merge 


0.090 (0.663) 


Ijal 


10.9 (2.3) 


Completeness (%) 


99.8 (98.3) 


Redundancy 


5.2 (5.4) 


Refinement 




Resolution (A) 


20.0-1.90 


No. reflections 


20 595 


-Kwork/ ^free 


0.176/0.200 


No. atoms 




Protein 


1368 


Ligand/Ion 


14 


Water 


163 


B-factors 




Protein 


43.5 


Ligand/ion 


57.8 


Water 


49.9 


RMSD 




Bond lengths (A) 


0.010 


Bond angles (°) 


1.27 


selenium atoms were 


identified with phenix.hyss (32) that 



picked up seven sites. Substructure refinement and com- 
pletion was performed in PHASER (33). Phase probabil- 
ity distributions were modified using DM; at this point the 
enantiomeric space group P6i22 was chosen as correct. 
Automated model building with ARP/wARP 7.0 (34) 
using the SAD likelihood refinement function (35) 
resulted in 136 residues distributed in two chains. The 
model was completed manually using COOT (36), 
interspersed with reciprocal space refinement cycles in 
REFMAC5 (37), using bulk solvent scaling, non-polar 
hydrogens and four TLS groups. The final model was 
validated using WHATCHECK (38) and the MolProbity 
(39) server and is of excellent quality without any 
Ramachandran outliers, ranking in the 95th best percent- 
ile of PDB structures in that resolution. The model is de- 
posited in the PDB with code 2XSE. 

Structure similarity searches 

Structure similarity searches were carried out with two 
methods. SSM searches (40) against the whole PDB, 
after changing default search values as to the server sug- 
gestions, returned some similar structures with low signifi- 
cance scores. The top-hit, however (3FDQ), displayed an 
RMSD between Ca-atoms of matched residues of 3.25 A 
over a length of 84 residues and a statistical significance 
Z-score of 1.3, despite a very low sequence identity of 6% 
in the aligned regions and a P-score (minus logarithm of 
the probability of achieving the same or better quality of 
match at a chance) of zero. All other SSM hits had 
Z-scores below 1.0. DALI searches, however (41), 
returned the same hit in the second position in the list 
ranked by Z-score, with a significance Z-score of 5.3 
and similar alignment metrics as SSM. All other DALI 
hits were helical proteins but with different connectivity 



than our query structure. Manual inspection of the SSM 
and DALI structural alignments clarified that both the 
MogR structure (3FDQ) and DB-JBP1 had identical con- 
nectivity in the core four helices and we thus considered 
the two domains as structural homologs. 

HDX MS 

Solution phase amide HDX experiments were performed 
with a LEAP technologies (Carroboro, NC, USA) CTC 
HTS twin PAL auto sampler interfaced with an 
electrospray ionization (ESI) linear ion trap mass spec- 
trometer (Thermo Electron, San Jose, CA, USA) (42). 
Of a 11 uM JBP1 stock solution (0.7M NaCl, 15mM 
KC1, pH 7.0), 4ul were diluted with 16 ul of an equivalent 
composition D 2 0 buffer and incubated for 1, 30, 60, 900 
and 3600 s to allow H/D exchange to occur. Following the 
allotted time period, the sample was diluted to 50 ul with 
cold (1.5°C) 2M urea containing 0.1% TFA. Protein was 
then passed through an immobilized pepsin column 
(2 mm x 2 cm made in-house) held at 1.5°C and the result- 
ant peptides were captured on a C8 peptide micro trap 
(Michrom Bioresources, Inc., Auburn, CA, USA). 
Peptides were then eluted across an HPLC column 
(Betasil C18 50 x 2.1 mm, 5 urn) into the ESI source with 
a gradient of 2% CH 3 CN increasing to 50% CH 3 CN over 
18min. For the JBP1+J-DNA experiment, lluM JBP1 
stock solution was incubated with a J-base containing syn- 
thetic oligonucleotide prior to mixing with D 2 0 buffer and 
subsequent analysis. Data represent the average of three 
replicates and were processed with in-house developed 
software (43). 

Surface plasmon resonance assays 

Surface plasmon resonance (SPR) spectroscopy was per- 
formed at 25°C on a Biacore T100. Oligonucleotides were 
prepared as described for the FP assays, but with a 
biotin-TEG labeled complementary strand, that was 
used to immobilize them on a streptavidin chip (SA, GE 
Healthcare). Concentrations series of all proteins were 
injected across the chip in a PBS buffer pH 7.4, with 
0.05% (w/v) Tween-20 at a flow rate of 30ul/min. The 
concentrations used were between 32 and 0.25 uM for all 
DB-JBP1 variants. Binding curves were recorded for each 
protein, using an empty flow cell as reference. All experi- 
ments were repeated at least twice in a non-sequential 
manner to exclude systematic errors. The Biacore T100 
evaluation software was used for the initial data analysis. 

Testing the function of the JBP1 D525A mutant in vivo 

JBP1 is essential in L. tarentolae for making J-base (16) 
and this complicates the testing of mutant versions of 
JBP1. We have used the procedure that was described in 
(18) to test the hydroxylation activity of JBP1 in a rescue 
experiment. The principle of this experiment is schematic- 
ally outlined in Supplementary Figure S2. One of the two 
alleles of the JBP1 gene in L. tarentolae was disrupted first 
with a JBP1 KO puromycin construct. The resulting 
JBPl +/ ~ cells were selected with puromycin and subse- 
quently transfected with plasmids containing either the 
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wild-type JBP1 or JBP1 D525A mutant. Transfected cells 
were then selected with paromomycin, as these plasmids 
also contain a neomycin marker. Next, the second JBP1 
allele was targeted with a JBP1 KO hygromycin construct. 
JBPr 1 ' cells were then selected with puromycin and 
hygromycin and paromomycin was added to ensure reten- 
tion of the rescue plasmid containing either wild-type or 
mutant JBP1. 

Numerous colonies were obtained with the cells con- 
taining the rescue plasmid with wild-type JBP1. Ncol re- 
striction digests of the DNA of some of these colonies 
were hybridized with a radioactive probe, corresponding 
to the 3'-UTR of the JBP1 gene that is not present in the 
rescue gene of the plasmids. The blots were used to 
identify the endogenous JBP1 gene or the rescue con- 
structs as outlined in Supplementary Figure S3. In six 
out of the eight clones, the band corresponding to the 
endogenous JBP1 gene was not present (Supplementary 
Figure S3), indicating a complete disruption of both JBP1 
alleles and has a reasonable efficiency of the rescue 
plasmid. Only few clones were obtained when the JBP1 
D525A rescue plasmid was used for rescue, already 
indicating that this mutant is inefficient in rescuing the 
endogenous JBP1 gene disruption. All of these clones 
had retained a wild-type JBP1 gene band, as shown in 
Supplementary Figure S3 (lanes 11-18); duplication of 
the endogenous wild-type JBP1 gene, is an expected 
reaction of Leishmania, as reported (16,18). 

To verify that JBP1 protein was actually produced in 
sufficient amounts in the transfected cells, we quantified 
the amount of JBP1 by western blotting. Supplementary 
Figure S4 shows that some of the mutant rescue clones 
produced high amounts of JBP1. The localization of that 
protein was subsequently assessed by fluorescence micros- 
copy. For fluorescence microscopy, wild-type L. tarentolae 
and cells overproducing the D525A mutant protein were 
fixed on the same slide with 1% formaldehyde in PBS, 
blocked with lx blocking solution (Roche). Primary 
(a-JBPl) and secondary (Alexa 488 conjugated goat 
a-rabbit) antibodies were incubated in 3% BSA/PBS at 
37°C. Slides were mounted in Vectashield solution 
(Vector Laboratories). The DNA was stained with 
Topro3 DNA stain (invitrogen). Images were captured 
with a Leica TCS SP2 AOBS confocal microscope. 
Details of the procedure are described in (22); the 
a-JBPl antibody was described in Ref. (6). 

Modeling of the DB-JBP1 complex 

The complex of MogR with DNA (3FDQ) was used as the 
template to create a model of the binding of DB-JBP1 to 
J-DNA. First, the recognition helix of DB-JBP1 (517-525) 
was superposed to the recognition helix of MogR using 
COOT (36). Then, a model for J-DNA was created as a 
standard 20-mer B-DNA with the sequence used for the 
SAXS experiment. A 3 bp fragment of J-DNA from the 
molecular dynamics simulations that suggested the specific 
conformation of J-base in the major groove (26) was then 
grafted to the 20-mer DNA to create a J-DNA 20-mer, 
with J at position 12. The geometry of that structure was 
idealized in REFMAC5 (37) using interval restraints (44) 



for describing the hydrogen bonds of the J-base to the 
7V-l-phosphoryl oxygen and between all other Watson- 
Crick base pairs. The 7nt bracketing the J-base in the 
J-containing strand (9-15) of this idealized 20-mer were 
then superposed to the DNA in the MogR complex, 
'sliding' it in various positions to find a placement that 
would allow the J-base to be located close to Asp525 of 
the recognition helix; a unique suitable solution was found 
and was used to create the DB-JBP1: J-DNA complex. In 
that complex, both C4 and C6 hydroxyls were very close 
to Asp525, whereas Arg517, Lys522, Lys524 were all close 
to DNA and available for interactions. That encouraged 
us to model the loop missing from the structure, residues 
529-534, in a conformation where Arg532 was also in 
contact with DNA; a suitable stereo-chemically allowed 
conformation was easily identified using COOT. The 
final model geometry was idealized with REFMAC5 and 
was then compared against the SAXS experimental data 
and showed an excellent fit. 

SAXS data collection and analysis 

The same DB-JBP1 protein as used for crystallization was 
also used for the SAXS experiments. DB-JBP1 and double 
stranded J-DNA oligonucleotides with the sequence 5'-CA 
GAAGGCAGCJGCAACAAG-3' were created as 
described above, mixed at a 1:1 molar ratio and the 
complex was purified by size exclusion chromatography. 
Two concentrations of DB-JBP1 alone (1.48 and 2.58 mg/ 
ml) and three concentrations of the complex between 
DB-JBP1 and J-DNA (2.1, 2.5 and 6mg/ml) were 
measured at the XI 3 beamline at the EMBL-Hamburg 
and all data were possessed by PRIMUS (45). The agree- 
ment between the experimental scattering curves and the 
structure of DB-JBP1, the model of DB-JBP1 bund to 
J-DNA and J-DNA alone were evaluated using 
CRYSOL (46). The ab initio model for the 
DB-JBP1: J-DNA complex was created by DAMMIF (47). 

RESULTS 

JBP1 binds to J-DNA ten thousand times stronger 
than to T-DNA 

We have produced large amounts of JBP1 from 
C. fasciculata (C/ : JBP1) (15) and from L. tarentolae 
(X/-JBP1) using a synthetic gene. Both L/-JBP1 and 
C/ : JBP1 were purified with standard chromatographic 
methods as described in Materials and Methods. The 
high quality proteins obtained were used for all subse- 
quent assays. Initial characterization experiments were 
performed with C/ : JBP1; most subsequent experiments 
were done with L/-JBP1. 

The molecular mass of both C/-JBP1 and Z^-JBP1 was 
determined by MALLS, showing that both are monomers 
in solution (Supplementary Figure S5A). JBP1 also 
binds to DNA as a monomer, as indicated by size exclu- 
sion chromatography experiments using C/ : JBP1 
(Supplementary Figure S5B). 

Protein-DNA binding is typically dominated by 
hydrogen bonding and electrostatic interactions, which 
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are salt-sensitive. We therefore tested the dependence of 
J-DNA binding affinity on salt concentration. For this 
assay, we used a FP setup, similar to the one described 
in (26). The binding of JBP1 to a DNA sequence derived 
from the VSG gene, containing a J or a T nucleotide, 
exhibits a strong dependence on salt concentration 
(Figure 1A-C). The maximal discrimination between 
J-DNA and T-DNA takes place around physiological 
salt conditions and is about ten thousand times, with a 
K D for J-DNA of 11.1 nM and for T-DNA of 83 uM 
(Figure 1C and Supplementary Table SI). 

A small domain of JBP1, DB-JBP1, is sufficient for 
specific J-DNA recognition 

To determine whether a specific domain of JBP1 is respon- 
sible for J-DNA recognition, we performed a 'reverse 
footprint' experiment. C/-JBP1 was mixed with J-DNA 
in microgram amounts and treated with limiting 
amounts of protease in an attempt to identify a protein 
domain that becomes resistance to proteolysis upon 
J-DNA binding. We found conditions in which 
J-DNA-bound JBP1 produced fragments resistant to pro- 
teolysis, which were not present when JBP1 alone was 
digested under identical conditions (Figure 2A). The 
digested mixture was used in a non-radioactive agarose 
gel retardation assay with J-containing oligonucleotide. 
We identified a clear band of shifted J-DNA 
(Figure 2B). This band migrated faster than the control 
J-DNA bound by full-length JBP1 and overlapped with a 
protein band (Figure 2B), suggesting that a small 
proteolysis-resistant fragment was responsible for the 
band shift. This fragment was identified by N-terminal 
protein sequencing to start at residue 382, while a com- 
bination of molecular weight information, secondary 
structure predictions and alignments allowed us to also 
define the most likely C-terminus at residue 561 
(Figure 2C and D). This domain was termed DNA 
Binding JBP1 domain (DB-JBP1). 



The DB-JBP1 domain was produced in E. coli and 
yielded large amounts of soluble protein. We compared 
the binding of the Leishmania DB-JBP1 and full-length 
LkJBPI to J-DNA using the fluorescent polarization 
assay (Supplementary Figure S6 and Table SI). 
DB-JBP1 binds to J-DNA with an affinity about three 
times less than full-length JBP1 (32.9 nM compared to 
11.1 nM) and to T-DNA with a marginally lower affinity 
(129 uM compared to 83 liM; Supplementary Table SI). 
The discrimination between J-DNA and T-DNA at 
physiological salt concentrations is about the same for 
both JBP1 and DB-JBP1 (Supplementary Table SI). 

The crystal structure of the DB-JBP1 domain 

The DB-JBP1 structure was determined using Se-Met 
substituted DB-JBP1 in a single-wavelength anomalous 
dispersion (SAD) experiment and refined to yield a 
model with all residues in favorable regions of the 
Ramachandran plot and an Rf ree of 20.3% (Table 1). 
The final model does not contain the first 13 residues 
(382-394) and six residues in the loop between residues 
528 and 535, which could not be modeled in the electron 
density and are presumably disordered. 

The structure of DB-JBP1 is a helical bundle in an ar- 
rangement we call a 'helical bouquet' (Figure 3A and B). 
The fold is made by five helices, of which the four longest, 
otl, a2, a3 and a5 run antiparallel in the same approximate 
orientation (the 'flowers' of the bouquet), with their 
axes at angles of about 15°^t5°. Helix or.4 is perpen- 
dicular to this arrangement, creating a 'ribbon' running 
across the front. The helical bouquet appears to be a 
divergent variant of the aberrant three-helical bundle 
helix-turn-helix (HTH) domains, closest to the prokaryot- 
ic tetra-helical bundle (Figure 3E) (48). The ribbon helix in 
the DB-JBP1 helical bouquet corresponds to the recogni- 
tion helix of the core three-helical bundle (a2^1) preceded 
N-terminally by a long 28-residue helix (al) and followed 
C-terminally by a 19-residue helix (oc5). Notably, the sup- 
porting a3 helix and the recognition a4 (ribbon) helix are 
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Figure 1. Specificity of JBP1 for J-DNA over T-DNA is salt dependent. (A and B) Binding of varying concentrations of Lr-JBPl to J-DNA (A) and 
T-DNA (B) measured by the anisotropy of the signal of 1 nM labeled DNA, for seven different (buffered) concentrations of NaCl; symbols indicate 
the data points and vertical bars, the measures standard deviation; lines represent the fitted function that determination of the allowed K D ; for very 
tight K D the fit is poorer since it is likely to be far from the assumed equilibrium. (C) The ratio of the calculated K D for T-DNA over J-DNA 
(indicating the specificity toward J-DNA) is shown as the function of the ionic strength of the buffer; the dotted line represents a polynomial fit to 
the data without assuming a physical model. 



Nucleic Acids Research, 2011, Vol. 39, No. 13 5721 



JBP1 JBPI+Trypsin 



100. 
75" 
50- 
37- 

25- 
20- 

14- 

c 




SDS-PAGE fractionation 

JBP1 
J-DNA 

JBP1 From Trypsin From 
J-DNA B1/B3 30' B2/B4 

M A4 A5 A6 A7 



B Agarose gel J-DNA retardation 



EtBr stain 
From A4 From A6 

B1 B2 



Coomassie stain 
From A4 From A6 

B3 B4 





DB-JBP1 






(1 62(~ Thymidine hydroxylase )264 382(DNA-binding (DB-JBP1)) 561 



~~m) 



D 

L. tarentolae 



tarentolae 
aethiopica 
braziliensis 
donovani 
infantum 



L 
L 
L 
L 
L 

L. major 

C. f asciculata 

T.brucei 

T.cruzei 



L. tarentolae 



SDGI 
SDGI 
SDGI 
SDGI 
SDGI 
SDGI 
SDGI 
ADGL 
DDGL 



PLRGE 
PLRGE 
PLRGD 
PLRGE 
PLRGE 
PLRGE 
PLRAA 
PR RAP 
PRREK 



DEKV 
ENKL 
D E KL 
EEKL 
E E KL 
DKKL 
EQKL 
EEI I 
EQLI 



KANGD 
KANSD 
KANGD 
KANSD 
KANSD 
KANSD 
KANAD 
PVHKH 
SVHLP 



3 8 0 

STPRPL 
SAPRPL 
TGAKPL 
SASRPL 
SASRPL 
SASRPL 
GASR3V 
TNVC . M 
VVVK . M 



ni a0 

disordered sumsuui JUUISL 

390 400 410 



a1 



SLQJIQJULQJISISJLQJIQJLQJI^^ 

420 430 440 



SRLGGFS 
SRLGGFS 
SRLGGFS 
SRLGGFS 
SRLGGFS 
SRLGGFS 
TSSGGFS 
HYLGGFS 
SPSGGFS 



ETNLM 
ETNLM 
ETDLM 
ETNLM 
ETNLM 
ETNLM 
ESDAV 
EKGDI 
ELGGA 



VSTAV 
VSTAV 
VSTAA 
VSTAV 
VSTAV 
VSTAV 
LTTAV 
LNEAM 
LKAAE 



EKKKY 
EKKKY 
EKRKY 
EKKKY 
EKKKY 
EKKKY 
EKSKY 
NKRHY 
EKQYF 



LDSE 
LNSE 
D 

LNSE 
LNSE 
LNSE 
LERD 
LDKE 
FEEK 



FLLHC 
FLSHF 
FLSHC 
FLSHF 
FLSHF 
FLSHF 
HLSQC 
NLQKM 
YLADE 



I SAQ 
I SAQ 
I SAQ 
I SAQ 
ISAQ 
I SAQ 
I SAE 
FGEE 
LGPD 



LLDMW 
LLDMW 
D W 
LLDMW 
LLDMW 
LLDMW 
LLAMW 
FVNIW 
LMSMW 



KQARA 
KQARG 
KQARA 
KQARG 
KQARG 
KQARG 
VEARK 
TQSRT 
QSRA 



RWLE 
KWLE 
RWLE 
KWLE 
KWLE 
KWLE 
HWLR 
HWLQ 
HWLR 



LVGK 
LVGR 
LVGK 
LVGR 
LVGR 
LVGR 
LVAT 
LVKK 
LVKE 



EWAH 
EWTH 
EWKH 
EWTH 
EWTH 
EWTH 
EWAR 
EWEH 
DWER 



MLAL 
MLAL 
MLTL 
MLAL 
MLAL 
MLAL 
MIAT 
QKET 
LCRR 



a2 



4 5 0 



460 470 480 



a3 



490 500 510 



H/D-MS 23% 

a4 

SLQJLQJLQJIQJLQJI 

5 2 0 



tarentolae 
. aethiopica 
.braziliensis 

donovani 
. infantum 

major 

. f asciculata 
brucei 
. cruzei 



NPERKDF 
NPERKDF 
NPERKDF 
NPERKDF 
NPERKDF 
NPERKDF 
APERTDF 
NPTRTRF 
DPERTKF 



LWKNQSEMNSA 
LWKNQSEMN3A 
LWKNRSEMNSA 
LWKNQSEMNSA 
LWRNQSEMNSA 
LWKNQSEMNSA 
LWKNKS PMNTA 
SWNNTSAMNrA 
TWNNSSAMNAA 



FFDL 
FFDL 
FFDL 
FFDL 
FFDL 
FFDL 
FFDL 
FFDL 
FFDL 



CEVGKQ 
CEVGKQ 
CEVGKQ 
CEVGKQ 
CEVGKQ 
CEVGKQ 
CEVAKQ 
CDVAKQ 
CEVAKQ 



VMLGL 
VMLGL 
VMLGL 
VMLGL 
VMLGL 
VMLGL 
VMLGL 
LMCGA 
MMIGL 



LGKE 
LGKE 
I D K E 
LGKE 
LGKE 
LGKE 
LDKE 
FGDR 
LNKE 



VALPKE 
VALPKE 
AALPKE 
AALPKE 
AALPKE 
VALPKE 
TATPTE 
EVNKKE 
TPSSAE 



EQAFWI 
EQAFWT 
EQAFWT 
EQAFWT 
EQAFWT 
EQAFWT 
ERHFWS 
EQSFWG 
NHSFWI 



MYAV 
MYAV 
MYAV 
MYAV 
MYAV 
MYAV 
VYAA 
MFAA 
LFAA 



HLSAACAE 
HLNTACAE 
HLSAACAE 
HLNAACAE 
HLNAACAE 
HLNAACAE 
HLHRACAE 
HLDNAC I N 
HLNYACTT 



ELHM 
ELHM 
ELHM 
ELNM 
ELNM 
ELHM 
RLMM 
EIGM 
ENGM 



PEVAMS 
PHVAMS 
PHDAMS 
PHVAMS 
PHVAMS 
PHVAMS 
PEEAMS 
LQGSMG 
PRDAVG 



LRKLNVKLKDFNFG 
LRKLNVKLKDFNFG 
RKLNVKLKDFNFG 
: RKLNVKLKDFNFG 
I RKLNVKLKDFNFG 
LRKLNVKLKDFNFG 
LRKLNVKLKDFSFG 
MHKLNVKLKD :'::FG 
MHKLNVKLKDFHFG 



L. tarentolae 



disordered 

5 3 0 



H/D-MS 12% 



5 4 0 



H/D-MS 24% 



A A A A* 

H/D-MS 30% 



a5 



5 5 0 



5 6 0 



5 7 0 



5 8 0 



6 0 0 



6 10 



tarentolae 

aethiopica 

braziliensis 

donovani 

infantum 

major 

f asciculata 
brucei 



T.cruzei 



gtryfkdmppeekkrrmerkqrieearrhgmps 
gtryfkdmppeeqkrrmerkqrieearrhgmss 
gtryfkdmppeeqqrrmerkqrieearrhgmt 
gtryrkdmppeeqkrrmerkqrieearrhgmss 
gtryfkdmppeeqkrrmerkqr1eearrhgmss 
gtryi'kdmppeeqkrrmerkqrieearrhgmss 
gtryfkdmpveeqerrvarkasieearrrstaa 
gtry:kdmppeeqerrrrrrleieqarrr.ap 
gtry:kdmppeeqerrlerkkrieearrrgs 



GSHEKRANWLTNDSFDYQTEDCVIDYAQHKWV 
GAHEKRANWLTNDSFDYQTEDCVVDYAQHKWP 
. GAHEKRANWLTNDSFDYQTEDCVVDYAKHKWV 
GAHEKRANWLTND S FDYQTEDC WD YAQHKWV 
GAHEKRANWLTNDSFDYQTEDCVFDYAQHKWV 
GAHEKRANWLTNDS FDYQTEDC VVDYAQHKWP 
KDGEQRSNWLTND AFDYQTEDCE VDYAGHGWA 
ICDSES . GDWLRNEAFDYQTEDVAVNYEREQWI 
AHETHT . DNWLLNDTFDYQQEDRKVEFEENGWM 



LPALHAKEVTKTVR . .TGEL 
PPALHAREITKNVR . .TGEL 
LXERHAKAVTKNVH . . TAW L 
PPALHAREITKNVR. . NGE L 
PPALHAREITKNVR. . SGEL 
PPALHAREITKNVR. .TGEL 
VPKQHAKTVTANVH . . QEAV 
TPENNARRFGFPERGVYGAE 
TPEAYVRHLGLKPCGDVTAA 
H/D-MS 11% 



Figure 2. Identification of the DB-JBP1 domain. (A) SDS-PAGE separation before and after limited proteolysis of the C/ : JBP1 protein alone and 
bound to J-DNA; the JBP1 and DB-JBP1 bands recovered after elution from the gel after the retardation assay are also shown and DB-JBPl-lis 
indicated by arrows. (B) Agarose gel DNA mobility retardation assay for JBP1 mixed with J-DNA before and after limited proteolysis, stained with 
ethidium bromide and with Coomassie blue; DB-JBP1 is indicated by arrows. (C) A schematic drawing of the JBP1 sequence with the N-terminal 
thymidine hydroxylase domain and the DB-JBP1 domain indicated. (D) A multiple sequence alignment of JBP1 from Leishmania, Trypanosoma and 
Crithidia; the DB-JBP1 domain is boxed; fully conserved residues are in black and bold on a gray background; conserved residues are in gray and 
bold; the secondary structure elements and residue numbers correspond to the L. tarantolae protein; disordered regions and peptides identified to be 
in contact with DNA in the HDX-MS experiment are annotated with labeled bars; wedge indicate Lys/Arg mutations and asterisks indicate D/E 
mutations (gray for no effect, black for affecting DNA binding). 



not connected by the sharp turn characteristic of HTH 
motifs, but by an unusually extensive loop of nine 
residues. The conserved 'small-hydrophobic-small' signa- 
ture motif in the turn between helix-2 and helix-3 of the 



core HTH structure (48) is conserved in the long connect- 
ing loop of the JBP1 family and is directly preceding the 
recognition helix (Ala-Met-Ser in Leishmania and 
Crithidia and Ser-Met-Gly or Ala-Val-Gly in the 
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Trypanosoma proteins). The 'charged-hydrophobic-small' 
motif in the supporting helix of HTH domains is absent in 
JBP1. 

Searches for structural homologs of DB-JBP1 in the 
PDB, using either SSM (40) or DALI (41), returned a 
significant, albeit weak, structural similarity with the 
HTH-domain structure of the MogR transcriptional re- 
pressor of the bacterial pathogen Listeria monocytogenes 
(49) (Figure 3E and Supplementary Figure S7). The 
MogR structure has been determined in complex with 
AT-rich DNA, with the recognition helix mediating 
major groove contacts as expected. The long C-terminal 
wing that mediates minor groove interaction with the 
AT-rich DNA in MogR, is in DB-JBP1 the 6-residue dis- 
ordered loop leading to the a5 helix that is absent in 




tri-helical tetra-helical DB-JBP1 



bundle bundle helical bouquet 

Figure 3. The structure of DB-JBP1. (A) Cartoon representation of 
DB-JBP1, in the orientation reminiscent of a flower bouquet; each 
helix is in a different color and labeled; the recognition helix is in 
yellow. (B) The same cartoon after a 90° rotation along the vertical 
axis. (C and D) Surface representations (in the same orientation as A 
and B respectively) colored according to the electrostatic surface po- 
tential (—7 to 7 kT). (E) Schematic drawing of typical HTH domains, 
DB-JBP1 and MogR, with helices drawn as cylinders; structurally 
equivalent helices are drawn in the same colors used for the structure 
of DB-JBP1. 



MogR. By analogy to MogR, this disordered loop likely 
gets ordered when DB-JBP1 gets in contact with DNA 
and could mediate minor groove interactions. 

The electrostatic surface potential of DB-JBP1 shows a 
large positive patch in the surface where the recognition 
helix lies, also suggesting that this is the primary area for 
DNA binding (Figure 3C and D). Given, however, the low 
sequence conservation (6% identity) and the large differ- 
ences between superposed Ca atoms (3.2 A) between 
DB-JBP1 and MogR, its closest HTH domain structural 
homolog, it was impossible to reliably predict in fur- 
ther detail the mode of JBP1 binding to DNA and 
explain the preference for J-DNA. We therefore used 
other methods to analyze the contacts of J-DNA with 
the JBP1 protein. 

HDX MS demonstrates the DB-JBP1 regions involved in 
DNA-binding 

We used HDX-MS to determine which regions of 
full-length JBP1 interact with DNA. The HDX-MS 
experiment measures the rate of exchange of protein 
backbone amide hydrogen with deuterium. The rate 
of this exchange is influenced primarily by hydro- 
gen bonding and therefore operates as a sensor for the 
local chemical environment of the amide hydrogen. 
The deuterium exchange is plotted as a function of 
time for the protein alone and is then compared to the 
values obtained for protein bound to DNA. The 
peptides that show reduced exchange rates upon 
complex formation with DNA are, therefore, involved in 
DNA binding, either directly or through allosteric 
perturbation of the protein structure/dynamics. We show 
that only a few peptides exhibit reduced H/D exchange 
rates upon binding of JBP1 to J-DNA, all spanning 
the DB-JBP1 domain (Figures 2C, 4A and B 
and Supplementary File SI for detailed results). The 
most prominent changes occurred in the peptides 
516-525 (helix a4, 30% reduction in exchange rate, 
Figure 4A). The other peptides that showed a significant 
change upon J-DNA binding, 434-^141 (helix ocl, 23%), 
510-515 (the loop between helices a3 and a4, 124%) and 
452^163 (helix a2 and the loop before, 12%) 
(Supplementary Figure S8), are also within the DB-JBP1 
domain. The peptide spanning residues 593-605 that 
showed a modest change (11%, Supplementary Figure 
S8) is just C-terminal to DB-JBP1. Eight different 
peptides that were identified in the HDX-MS experiment 
corresponding to the N-terminal half of the supporting 
helix a3 (~484-494), which often mediates some 
contacts with DNA, did not show any significant 
changes (—2% to 2%). A few more changes, all within 
the DB-JBP1 domain, had values below 10%, while no 
change above 5% was observed elsewhere in JBP1 
(Supplementary Figure S9). 

This experiment suggests that DB-JBP1 is the only JBP1 
domain involved in J-DNA binding and that it is sufficient 
for specific J-DNA recognition. The biggest change in 
hydrogen-deuterium exchange rates upon J-DNA 
binding overlaps perfectly with helix a4, confirming that 
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Figure 4. Mapping the DB-JBP1 regions that bind DNA. (A) The peptides that show hydrogen-deuterium exchange rate differences upon J-DNA 
binding are colored according to the magnitude of the differences and mapped in the DB-JBP1 structure; the insert depicts the change in hydrogen- 
deuterium exchange rate of the JBP1 peptides 516-525 (which corresponds to the recognition helix tn4) in the unbound and J-DNA bound state. 
(B) Location of the DB-JBP1 point mutants used to identify the residues involved in J-DNA binding. (C) Binding of varying concentrations of the 
various Arg/Lys mutants of DB-JBP1 to J-DNA (left) and T-DNA (right); colors are used purely to indicate the different curves more clearly. 
(D) Binding of DB-JBP1 and (E) JBP1 and the corresponding D525A mutants to T-DNA and to J-DNA. 
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residues in this helix are the primary site for DNA 
interaction. 

DB-JBP1 recognizes DNA through lysine and arginine 
residues in helix a4 

To find the residues that interact with the DNA backbone 
we created seven point mutants in DB-JBP1 (Figure 4B), 
based on structural data and the HDX-MS experiments: 
R517A, K518A, K522A, K524A targeting each positively 
charged residues in helix a4; R532A and K535A targeting 
the intrinsically disordered loop downstream helix a4 
which had an appreciable 8% decrease in the HDX-MS 
data and K436A lying at the end of helix al that also 
showed some changes in the HDX-MS data. All mutant 
proteins were expressed and purified and were used in the 
FP assay (Figure 4C). The R532A, K518, K522A and to a 
lesser extent K524A proteins, showed decreased affinity 
for both J-DNA and T-DNA, suggesting that these 
residues, all but one lying in the a4 ribbon helix, are 
involved in DNA binding, but do not specifically recog- 
nize the J-base. K436A and K535A showed no appreciable 
change in affinity. 

A single aspartate residue is responsible for J-binding 
specificity 

The glucose of the J-base lies in the major groove of the 
DNA and should be recognized specifically by DB-JBP1 
residues. We hypothesized that aspartate or glutamate 
residues could recognize the hydroxyl groups of the 
glucose moiety and these should be fully conserved in 
the JBP1 family. Ten such residues were located and we 
created eight point mutants (since there are two pairs of 
glutamates): E391A, D465A, E468A, E487/488A, D525A, 
D536A, E540/541A, E553A. Five of these mutants 
produced soluble protein in E. coli and were assayed in 
a SPR experiment in comparison with wt DB-JBP1. 
The D525A mutant protein exhibited no observable 
binding to J-DNA under the SPR experimental condi- 
tions, while all other mutants showed no significant 
differences compared to DB-JBP1 (Supplementary 
Figure S10 and Table SI). 

The D525A DB-JBP1 mutant was characterized in more 
detail in the FP assay. This mutant protein has strikingly 
lost nearly all discrimination for J-DNA, since its affinity 
for J-DNA is only 5.1 uM, similar to its affinity for 
T-DNA, 9.7 uM (Figure 4D and Supplementary Table 
SI). The affinity of the D5252A mutant for T-DNA 
(9.7 uM) has increased about 10 times compared to 
wild-type DB-JBP1 (~83 uM); a possible explanation for 
this observation is the removal of the repulsion between 
the negatively charged aspartate carboxylates and the 
DNA backbone phosphates. Consistently, the full-length 
JBP1 D525A mutant has the same low affinity for J-DNA 
(5.1 uM) and its affinity for T-DNA is 1 1.7 uM (Figure 4E 
and Supplementary Table SI). These results strongly 
indicate that a single residue, Asp525, is necessary and 
sufficient for conferring specificity toward J-DNA. 



Aspartate 525 is important for in vivo function of JBP1 

We tested whether the JBP1 D525A mutant can function- 
ally replace wild-type JBP1 in L. tarentolae, using an 
approach we previously described (18) and outlined in 
Supplementary Figure S2. Leishmania tarentolae cells in 
which one of the two JBP1 alleles had been inactivated by 
insertion of a drug resistance marker were transfected with 
plasmids expressing either wild-type or mutant JBP1. 
Then, the second chromosomal JBP1 allele was 
inactivated with a targeting construct containing a 
second drug resistance marker. Whereas both alleles of 
chromosomal JBP1 could be easily disrupted in the 
presence of the rescue plasmid containing wild-type 
JBP1, JBP1 D525A completely failed to rescue. Clones 
were obtained containing both drug resistance markers, 
but in these clones an extra copy of the wild-type gene 
had appeared (Supplementary Figure S3). Generation of 
additional wild-type gene copies is the standard response 
of Leishmania, if attempts are made to disrupt both alleles 
of an essential gene (16,50). The negative result with the 
mutant JBP1 is not due to defective synthesis or routing of 
the mutant protein, which is made in high amounts 
(Supplementary Figure S4). The protein is reaching the 
nucleus, as our anti-JBPl antibody detects more JBP1 in 
the nucleus of cells containing the rescue plasmid with 
mutant JBP1 than with wild-type JBP1 (Supplementary 
Figure S4). We conclude that mutant JBP1 is unable to 
provide sufficient JBP1 function for L. tarentolae to 
survive. 

Interestingly, the distribution of wild-type and D525A 
mutant JBP1 in the nucleus is different. Wild-type JBP1 is 
located in discrete spots (Figure 5), presumably the J-rich 
telomeres of L. tarentolae, which are known to be present 
in about 16 clusters in Leishmania (51). In contrast, 
JBP1-D525A exhibits a more diffuse nuclear staining 
pattern (Figure 5) indicating that it has lost its ability to 
specifically bind J-DNA also in vivo. 

A structural model for DB-JBP1 binding to J-DNA and 
validation by SAXS 

In the absence of a crystal structure of DB-JBP1 in 
complex with J-DNA, we created a model based on the 
structure of MogR with DNA (49), using interactive 
graphics (see Experimental Methods). The modeled 
J-DNA runs along the positive patch in JBP1 
(Figure 6A) and along the patch identified by the 
HDX-MS experiments (Figure 6B). In addition, Asp525 
is in contact with the J-base, while the Arg/Lys residues 
important for general DNA recognition are in contact 
with the J-DNA backbone (Figure 6C). 

To validate this model, we measured SAXS curves for 
DB-JBP1 alone and in complex with J-DNA in solution, 
to determine their low-resolution shape. The radius of 
gyration calculated from the SAXS data for the 
DB-JBP1 in complex with J-DNA (2.3 nm) fits well with 
that calculated from our model (2.15 nm). In addition, the 
calculated scattering curves from the structure of 
DB-JBP1 and from our model of the complex between 
DB-JBP1 and J-DNA, both fit very well only to their cor- 
responding experimental SAXS data (Figure 6D and 
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Figure 5. The D525A JBP1 mutant protein fails to localize in J-rich nuclear regions. Fluorescence microscopy images of L. tarentolae cells fixed on 
microscope slides and stained with anti-JBPI antibody and TOPro3 (DNA). JBP1 in the wild-type cells (upper panel) localizes in discrete spots in the 
nucleus, but the cells producing also the D525A mutant JBP1 (lower panel) show a more diffuse pattern. The bright staining DNA ovals in the cells 
are the kinetoplastid DNA networks; the nuclei stain less intense. The 'doughnut-like' staining with anti-JBPI antibody in the lower panel is 
probably due to weaker staining of the large nucleolus, creating an apparent hole in the nucleus. 



Supplementary Figure Sll). Furthermore, a low reso- 
lution ab initio beads model created based on the 
complex SAXS data is also in good agreement with our 
model (Figure 6D). 



DISCUSSION 

Under physiological conditions, JBP1 shows a remarkable 
10000-fold binding preference for J-DNA over T-DNA 
with the same sequence. This is in line with the results of 
Sabatini et al. (15) who showed that a 500-fold excess of 
T-DNA was unable to out-compete binding of J-DNA to 
JBP1. In salt concentrations lower than physiological, this 
specificity is compromised, explaining previous reports 
that the preference for J-DNA is only 100-fold (26). 

We show that the DB-JBP1 domain, which has affinity 
and specificity close to that of full-length JBP1, is the only 
region that mediates binding of JBP1 to J-DNA. The two 
functions of JBP1 are therefore separated in two different 
parts of the protein: the thymidine hydroxylase resides in 
the N-terminal half and the J-DNA binding in the 
C-terminal half. Cross et al. (14) raised the possibility 
that a HTH motif located in the N-terminal half of 
JBP1 (residues ~250-260), would be involved in DNA 
binding, as this motif had some similarity with the Myb 
homeodomain. This hypothetical binding site is not 
corroborated by our experiments. 

The DB-JBP1 fold is a divergent variant of the HTH 
domains family, a motif that can be traced back in 6-9 
copies, with divergent functionality in the last universal 
common ancestor (LUCA), and it is a common module 
in many proteins and multiprotein assemblies (48). The 
HTH domain in DB-JBP1 is characterized by an unusual- 
ly long 9-residue 'turn' and the long helices that bracket 
the core three-helical bundle. This helical bouquet is not 
readily classifiable in the HTH families described. This 



raises puzzling questions as to its ancestry and incorpor- 
ation into the JBP1 multidomain scaffold. 

The recognition helix of the HTH motif, the ribbon of 
the helical bouquet, bears a single aspartate residue 
(Asp-525) that appears to be the sole determinant of 
J-DNA versus T-DNA recognition. It has been suggested 
(26) that the C2 and C3 hydroxy groups of the J-base 
glucose form hydrogen bonds with the non-bridging 
pro-R phosphoryl oxygen of the J-l nucleotide phosphate 
group. In this model, the C4 and C6 hydroxy groups of 
the glucose moiety are free to engage in additional 
contacts. We speculate that Asp-525 is recognizing one 
or both of the C4 or C6 hydroxy groups. 

Our model does not point to any direct binding con- 
tacts between JBP1 and the pyrimidine heterocycle of 
base J. However, it has been shown that replacement 
of the hydroxymethyluracil of base J with hydroxy- 
methylcytosine reduces binding to JBP1 ~15 times (27). 
This result argues that JBP1 directly recognizes the base 
heterocycle, putting our model at apparent odds with the 
experimental data. However, chemical intuition suggests 
that a potential intramolecular hydrogen bond can be 
formed between the C4 amino group of the cytosine 
heterocycle and the C6 hydroxy group of the pendant 
(3-D-glucosyl, an interaction that cannot be established in 
the native base J. This hydrogen bond would result in a 
change in the overall geometry of the 'edge on' conform- 
ation of the (3-D-glucose group in the major groove, a 
critical determinant of JBP1 binding (26). Therefore, in 
the specific case of hydroxymethylcytosine, the reduction 
in JBP1 binding can be accounted for without invoking 
any recognition of the heterocycle by JBP1. The binding 
of JBP1 to in vitro glucosylated hydroxymethylcytosine- 
DN A is of special interest, as it has recently been used in a 
new and efficient method, to isolate DNA containing 
hydroxymethylcytosine (52). This should help to locate 



5726 Nucleic Acids Research, 2011, Vol. 39, No. 13 




0.01 -I , . . . , . 

0.0 0.05 0.10 0.15 0.20 0.25 0.30 

s (nm-'j 

Figure 6. A model of JBP1 binding to J-DNA. (A) JBP1 is shown as a 
surface colored by electrostatic potential; J-DNA is shown as a cartoon 
with the J-base as stock model; while the DNA backbone runs along 
the positive (blue) patch in the JBP1 surface, the J-base faces a negative 
(red) 'island'. (B) Same as in A but the JBP1 surface is colored by the 
propensity of peptides to interact with J-DNA according to the 
HDX-MS experiments, like in Figure 4A. (C) A close up of the inter- 
action of JBP1 with the J-base; the residues important for interaction 
according to mutagenesis studies are shown as stick models and are 
labeled. (D) Validation of the model against SAXS data; the graph 
shows the intensity calculated from the model (red line) and the experi- 
mental data (open cyan circles) against the scattering amplitude; in the 
inlet picture the pink surface represent an ah initio beads model 
calculated from the SAXS data and superposed onto the 
JBP1J-DNA complex model. 



this new mammalian epigenetic marker base (24,53) more 
precisely in DNA. 

The D525A mutant of JBP1, in contrast to wild-type 
JBP1, fails to rescue the targeted JBP1 gene deletion in 
L. tarentolae. Previous work with T.brucei has shown that 
introduction of wild-type JBP1 into trypanosomes without 
any J in their DNA results in J synthesis (10). JBP1 is, 
therefore, able to start from scratch, i.e. hydroxylate some 
T-residues in T-DNA, to which it binds poorly. Once a 
few initial J molecules have been made, hydroxylation can 
apparently accelerate sufficiently to introduce substantial 
amounts of J into the DNA (10). In contrast, with the 
D525A mutant of JBP1 the sluggish introduction of 
some J is evidently not followed by a rapid increase in J, 
because mutant JBP1 is unable to bind strongly and spe- 
cifically to J and hydroxylate more T's in an efficient 
fashion. Highly specific J-binding is therefore an essential 
prerequisite for JBP1 function. From previous experi- 
ments (54), we have deduced that even 10-15% of 
wild-type JBP1 function is sufficient for normal multipli- 
cation of L. tarentolae: it follows that D525A JBP1 
provides less than 10% of wild-type function and that 
DNA binding alone cannot restore the complete JBP1 
function without the specific recognition and strong 
binding of JBP1 to pre-existing J. 

Having established that Asp525 is a residue crucial for 
JBP1 function both in vitro and in vivo, we constructed a 
model for JBP1 binding to J-DNA, where the ribbon helix 
is positioned along the major groove of J-DNA. The 
low-resolution shape of our model is consistent with 
solution X-ray scattering experiments. Although the 
atomic interactions in this model are in agreement with 
all our mutagenesis data, the details remain to be struc- 
turally validated. 

JBP1 was assumed to have a 'glucose-binding pocket' 
that could be targeted by potential small molecule inhibi- 
tors. Unfortunately, our model renders this possibility less 
likely, since the glucose is recognized by the aspartate 
residue protruding from the 'ribbon 1 helix, much in the 
same manner as other helical major groove interactions 
in DNA recognition. Such interactions have been notori- 
ously difficult to target by small-molecule inhibitors. 
However, the specific charge pattern, with the acidic as- 
partate surrounded by basic residues, could provide a 
possible target site and a positive surprise. In addition, 
small molecules or short synthetic peptide inhibitors 
based on the DB-JBP1 recognition helix, targeting 
J-DNA rather than JBP1, or chemicals targeting the 
J-DNA conformation as previously proposed (26), might 
present a better suited therapeutic strategy. 
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